Collaborative search engine

Collaborative Search Engines (CSEs) are an emerging trend for Web search and Enterprise search within company intranets. CSEs let users concert their efforts in information retrieval (IR) activities, share information resources collaboratively using knowledge tags, and allow experts to guide less experienced people through their searches. Collaboration partners do so by providing query terms, collective tagging, adding comments or opinions, rating search results, and links clicked of former (successful) IR activities to users having the same or a related information need.

Contents

Models of collaboration

Collaborative search engines can be classified along several dimensions: intent (explicit and implicit) and synchronization [1] and depth of mediation [2], task vs. trait [3], and division of labor and sharing of knowledge [4].

Explicit vs. implicit collaboration

Implicit collaboration characterizes Collaborative filtering and recommendation systems in which the system infers similar information needs. I-Spy [5], Jumper 2.0, Seeks, the Community Search Assistant[6], the CSE of Burghardt et al.[7], and the works of Longo et al. [8] [9] [10] all represent examples of implicit collaboration. Systems that fall under this category identify similar users, queries and links clicked automatically, and recommend related queries and links to the searchers.

Explicit collaboration means that users share an agreed-upon information need and work together toward that goal. For example, in a chat-like application, query terms and links clicked are automatically exchanged. The most prominent example of this class is SearchTogether[11] published in 2007. SearchTogether offers an interface that combines search results from standard search engines and a chat to exchange queries and links. Reddy et al.[12] (2008) follow a similar approach and compares two implementations of their CSE called MUSE and MUST. Reddy et al. focuses on the role of communication required for efficient CSEs. Representatives for the class of implicit collaboration are I-Spy[5], the Community Search Assistant[6], and the CSE of Burghardt et al.[7]. Cerciamo [2] supports explicit collaboration by allowing one person to concentrate on finding promising groups of documents, while having the other person make in-depth judgments of relevance on documents found by the first person.

However, in Papagelis et al.[13] terms are used differently: they combine explicitly shared links and implicitly collected browsing histories of users to a hybrid CSE.

Community of practice

Recent work in collaborative filtering and information retrieval has shown that sharing of search experiences among users having similar interests, typically called a community of practice or community of interest, reduces the effort put in by a given user in retrieving the exact information of interest.[14].

Collaborative search deployed within a community of practice deploys novel techniques for exploiting context during search by indexing and ranking search results based on the learned preferences of a community of users.[15]. The users benefit by sharing information, experiences and awareness to personalize result-lists to reflect the preferences of the community as a whole. The community representing a group of users who share common interests, similar professions. The best known example is the open-source project Jumper 2.0 [16].

Depth of mediation

This refers to the degree that the CSE mediates search.[2] SearchTogether[11] is an example of UI-level mediation: users exchange query results and judgments of relevance, but the system does not distinguish among users when they run queries. Cerchiamo[2] and recommendation systems such as I-Spy[5] keep track of each person's search activity independently, and use that information to affect their search results. These are examples of deeper algorithmic mediation.

Task vs. trait

This model classifies people's membership in groups based on the task at hand vs. long-term interests; these may be correlated with explicit and implicit collaboration.[3]

Privacy-aware collaborative search engines

Search terms and links clicked that are shared among users reveal their interests, habits, social relations and intentions[17]. In other words, CSEs put the privacy of the users at risk. Studies have shown that CSEs increase efficiency [11][18] [19] [20]. Unfortunatelly, by the lack of privacy enhancing technologies, a privacy aware user who wants to benefit from a CSE has to disclose his entire search log. (Note, even when explicitly sharing queries and links clicked, the whole (former) log is disclosed to any user that joins a search session). Thus, sophisticated mechanisms that allow on a more fine grained level which information is disclosed to whom are desirable.

As CSEs are a new technology just entering the market, identifying user privacy preferences and integrating Privacy enhancing technologies (PETs) into collaborative search are in conflict. On one hand, PETs have to meet user preferences, on the other hand one cannot identify these preferences without using a CSE, i.e., implementing PETs into CSEs. Today, the only work addressing this problem comes from Burghardt et al.[21] They implemented a CSE with experts from the information system domain and derived the scope of possible privacy preferences in a user study with these experts. Results show that users define preferences referring to (i) their current context (e.g., being at work), (ii) the query content (e.g., users exclude topics from sharing), (iii) time constraints (e.g., do not publish the query X hours after the query has been issued, do not store longer than X days, do not share between working time), and that users intensively use the option to (iv) distinguish between different social groups when sharing information. Further, users require (v) anonymization and (vi) define reciprocal constraints, i.e., they refer to the behavior of other users, e.g., if a user would have shared the same query in turn.

References

  1. ^ Golovchinsky Gene, Pickens Jeremy (2007), "Collaborative Exploratory Search", Proceedings of HCIR 2007 workshop, http://projects.csail.mit.edu/hcir/web/hcir07.pdf 
  2. ^ a b c d Pickens Jeremy, Golovchinsky Gene, Shah Chirag, Qvarfordt Pernilla, Back Maribeth (2008), Collaborative Exploratory Search, pp. 315–322, doi:10.1145/1390334.1390389, http://portal.acm.org/citation.cfm?id=1390389 
  3. ^ a b Morris Meredith, Teevan Jaime (2008), Understanding Groups’ Properties as a Means of Improving Collaborative Search Systems, http://workshops.fxpal.com/jcdl2008/submissions/tmpDF.pdf 
  4. ^ Foley Colum (2008), Division of Labour and Sharing of Knowledge for Synchronous Collaborative Information Retrieval, http://www.computing.dcu.ie/~cfoley/cfoley-PhD_thesis.pdf 
  5. ^ a b c Barry Smyth, Evelyn Balfe, Peter Briggs, Maurice Coyle, Jill Freyne (2003), "Collaborative Web Search", IJCAI: 1417–1419 
  6. ^ a b Natalie S. Glance (2001), "Community search assistant", Workshop on AI for Web Search AAAI'02 
  7. ^ a b Thorben Burghardt, Erik Buchmann, Klemens Böhm (2008), "Discovering the Scope of Privacy Needs in Collaborative Search", Web Intelligence (WI), http://www2.computer.org/portal/web/csdl/doi/10.1109/WIIAT.2008.165 
  8. ^ Longo Luca, Barrett Stephen, Dondio Pierpaolo (2009), "Toward Social Search - From Explicit to Implicit Collaboration to Predict Users' Interests", WEBIST 2009 - Proceedings of the Fifth International Conference on Web Information Systems and Technologies, Lisbon, Portugal, March 23-26, 2009 1: 693–696, ISBN 978-989-8111-81-4 
  9. ^ Longo Luca, Barrett Stephen, Dondio Pierpaolo (2010), "Enhancing Social Search: A Computational Collective Intelligence Model of Behavioural Traits, Trust and Time", Transaction Computational Collective Intelligence II 2: 46–69, doi:10.1007/978-3-642-17155-0_3, http://www.springerlink.com/content/e12233858017h042/ 
  10. ^ Longo Luca, Barrett Stephen, Dondio Pierpaolo (2009), "Information Foraging Theory as a Form of Collective Intelligence for Social Search", Computational Collective Intelligence. Semantic Web, Social Networks and Multiagent Systems, First International Conference, ICCCI 2009, Wroclaw, Poland, October 5-7, 2009. Proceedings 1: 63–74, ISBN 978-3-642-04440-3, http://dl.acm.org/citation.cfm?id=1692026 
  11. ^ a b c Meredith Ringel Morris, Eric Horvitz (2007), "SearchTogether: An Interface for Collaborative Web Search", UIST, http://portal.acm.org/citation.cfm?id=1294211.1294215 
  12. ^ Madhu C. Reddy, Bernhard J. Jansen, Rashmi Krishnappa (2008), "The Role of Communication in Collaborative Information Searching", ASTIS 
  13. ^ Athanasios Papagelis, Christos Zaroliagis (2007), "Searchius: A Collaborative Search Engine", ENC '07: Proceedings of the Eighth Mexican International Conference on Current Trends in Computer Science: 88–98, doi:10.1109/ENC.2007.34, http://portal.acm.org/citation.cfm?id=1302894 
  14. ^ Rohini U, Vamshi Ambati (2002), "A Collaborative Filtering based Re-ranking Strategy for Search in Digital Libraries", ICADL2005: the 8th International Conference on Asian Digital Libraries, http://www.aaai.org/Papers/Workshops/2006/WS-06-10/WS06-10-004.pdf 
  15. ^ Maurice Coyle and Barry Smyth (2008), Nejdl, Wolfgang; Kay, Judy; Pu, Pearl et al., eds., "Social Aspects of a Collaborative, Community-Based Search Network", Adaptive Hypermedia and Adaptive Web-Based Systems, Volume 5149/2008: 103–112, doi:10.1007/978-3-540-70987-9, ISBN 978-3-540-70984-8, http://portal.acm.org/citation.cfm?id=1485050 
  16. ^ Jumper Networks Inc (2010), "Jumper Networks Releases Jumper 2.0.1.5 Platform with New Community Search Features", Press release, http://www.jumpernetworks.com/documentation.html 
  17. ^ Data Protection Working Party (2008), "Article 29 EU Data Protection Working Party", EU 
  18. ^ Barry Smyth, Evelyn Balfe, Oisin Boydell, Keith Bradley, Peter Briggs, Maurice Coyle, Jill Freyne (2005), "A Live-User Evaluation of Collaborative Web Search", IJCAI 
  19. ^ Smyth,, Barry and Balfe,, Evelyn (2005), "Anonymous personalization in collaborative web search", Inf. Retr. 9 (2): 165–190, doi:10.1007/s10791-006-7148-z 
  20. ^ Seikyung Jung, Juntae Kim, Herlocker, J.L. (2004), "Applying Collaborative Filtering for Efficient Document Search", Inf. Retr.: 640–643 
  21. ^ Thorben Burghardt, Erik Buchmann, Klemens Böhm, Chris Clifton (2008), "Collaborative Search And User Privacy: How Can They Be Reconciled?", CollaborateCom, http://dbis.ipd.uni-karlsruhe.de/1184.php